AITopics

doi: 10.1145/3750720.3757282

2507.23018

Country: North America > United States > Tennessee > Anderson County > Oak Ridge (0.15)

Genre: Research Report (1.00)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Energy (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Data Science > Data Quality (0.94)

arXiv.org Artificial IntelligenceJun-26-2025

AIDRIN 2.0: A Framework to Assess Data Readiness for AI

Hiniduma, Kaveen, Ryan, Dylan, Byna, Suren, Bez, Jean Luca, Madduri, Ravi

AI Data Readiness Inspector (AIDRIN) is a framework to evaluate and improve data preparedness for AI applications. It addresses critical data readiness dimensions such as data quality, bias, fairness, and privacy. This paper details enhancements to AIDRIN by focusing on user interface improvements and integration with a privacy-preserving federated learning (PPFL) framework. By refining the UI and enabling smooth integration with decentralized AI pipelines, AIDRIN becomes more accessible and practical for users with varying technical expertise. Integrating with an existing PPFL framework ensures that data readiness and privacy are prioritized in federated learning environments. A case study involving a real-world dataset demonstrates AIDRIN's practical value in identifying data readiness issues that impact AI model performance.

aidrin, artificial intelligence, machine learning, (13 more...)

2505.18213

Genre: Research Report (0.50)

Industry:

Health & Medicine (1.00)
Information Technology > Security & Privacy (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Tiger, Mattias, Jakobsson, Daniel, Ynnerman, Anders, Heintz, Fredrik, Jönsson, Daniel

Exploratory Visual Analysis for Increasing Data Readiness in Artificial Intelligence Projects

arXiv.org Artificial IntelligenceSep-5-2024

We present experiences and lessons learned from increasing data readiness of heterogeneous data for artificial intelligence projects using visual analysis methods. Increasing the data readiness level involves understanding both the data as well as the context in which it is used, which are challenges well suitable to visual analysis. For this purpose, we contribute a mapping between data readiness aspects and visual analysis techniques suitable for different data types. We use the defined mapping to increase data readiness levels in use cases involving time-varying data, including numerical, categorical, and text. In addition to the mapping, we extend the data readiness concept to better take aspects of the task and solution into account and explicitly address distribution shifts during data collection time. We report on our experiences in using the presented visual analysis techniques to aid future artificial intelligence projects in raising the data readiness level.

data readiness, readiness, visualization, (16 more...)

2409.03805

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Europe > Sweden > Östergötland County > Linköping (0.04)
Europe > Germany (0.04)
(5 more...)

Genre: Research Report (0.50)

Industry: Transportation (0.46)

Technology:

Information Technology > Data Science > Data Quality (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Hiniduma, Kaveen, Byna, Suren, Bez, Jean Luca, Madduri, Ravi

AI Data Readiness Inspector (AIDRIN) for Quantitative Assessment of Data Readiness for AI

arXiv.org Artificial IntelligenceJun-27-2024

"Garbage In Garbage Out" is a universally agreed quote by computer scientists from various domains, including Artificial Intelligence (AI). As data is the fuel for AI, models trained on low-quality, biased data are often ineffective. Computer scientists who use AI invest a considerable amount of time and effort in preparing the data for AI. However, there are no standard methods or frameworks for assessing the "readiness" of data for AI. To provide a quantifiable assessment of the readiness of data for AI processes, we define parameters of AI data readiness and introduce AIDRIN (AI Data Readiness Inspector). AIDRIN is a framework covering a broad range of readiness dimensions available in the literature that aid in evaluating the readiness of data quantitatively and qualitatively. AIDRIN uses metrics in traditional data quality assessment such as completeness, outliers, and duplicates for data evaluation. Furthermore, AIDRIN uses metrics specific to assess data for AI, such as feature importance, feature correlations, class imbalance, fairness, privacy, and FAIR (Findability, Accessibility, Interoperability, and Reusability) principle compliance. AIDRIN provides visualizations and reports to assist data scientists in further investigating the readiness of data. The AIDRIN framework enhances the efficiency of the machine learning pipeline to make informed decisions on data readiness for AI applications.

aidrin, dataset, readiness, (12 more...)

2406.19256

Country:

Europe > France > Brittany > Ille-et-Vilaine > Rennes (0.05)
Europe > Germany > Bavaria > Regensburg (0.04)
North America > United States > Ohio > Franklin County > Columbus (0.04)
(4 more...)

Genre: Research Report (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Oncology (1.00)
Energy (0.68)
Government (0.68)

Technology:

Information Technology > Data Science > Data Quality (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.67)

Hiniduma, Kaveen, Byna, Suren, Bez, Jean Luca

Data Readiness for AI: A 360-Degree Survey

arXiv.org Artificial IntelligenceApr-8-2024

Data are the critical fuel for Artificial Intelligence (AI) models. Poor quality data produces inaccurate and ineffective AI models that may lead to incorrect or unsafe use. Checking for data readiness is a crucial step in improving data quality. Numerous R&D efforts have been spent on improving data quality. However, standardized metrics for evaluating data readiness for use in AI training are still evolving. In this study, we perform a comprehensive survey of metrics used for verifying AI's data readiness. This survey examines more than 120 papers that are published by ACM Digital Library, IEEE Xplore, other reputable journals, and articles published on the web by prominent AI experts. This survey aims to propose a taxonomy of data readiness for AI (DRAI) metrics for structured and unstructured datasets. We anticipate that this taxonomy can lead to new standards for DRAI metrics that would be used for enhancing the quality and accuracy of AI training and inference.

application, data readiness, dataset, (12 more...)

2404.05779

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
North America > United States > New York > New York County > New York City (0.04)
Oceania > Australia > Queensland (0.04)
(10 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.34)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)
Education (1.00)
(2 more...)

Technology:

Information Technology > Data Science > Data Quality (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(4 more...)

#artificialintelligenceJun-8-2022, 18:55:46 GMT

Black Cape Awarded JAIC Basic Ordering Agreement for AI Data Readiness

ARLINGTON, Va., June 07, 2022 (GLOBE NEWSWIRE) -- Black Cape, Inc., an Arlington, Virginia-headquartered dual-use technology company, has been awarded a spot on the Joint Artificial Intelligence Center (JAIC) Data Readiness for Artificial Intelligence Development (DRAID) Basic Ordering Agreement (BOA). The DRAID Program is a potential five-year, $241.6 million award focused on enabling the Department of Defense (DoD) to optimize its vast data resources to leverage AI to enhance its mission effectiveness. The multi-award BOA includes a range of tasks needed to create, acquire, curate, prepare, and manage data for use in DOD artificial intelligence and machine learning models and application development, all areas where Black Cape maintains extensive experience in the national security and defense space. "We are honored to have been selected for this important effort to bring artificial intelligence enabled tools and applications to the JAIC," said Al Di Leonardo, Co-Founder and CEO of Black Cape. Black Cape technologies are used across Government, the Intelligence Community, DoD, and US Special Operations Command (SOCOM) to provide analytic services, artificial intelligence and machine learning capabilities.

awarded jaic basic ordering agreement, basic ordering agreement, black cape, (7 more...)

Country: North America > United States > Virginia > Arlington County > Arlington (0.58)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Government > Military (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.80)

Olsson, Fredrik, Sahlgren, Magnus

We Need to Talk About Data: The Importance of Data Readiness in Natural Language Processing

arXiv.org Artificial IntelligenceOct-11-2021

In this paper, we identify the state of data as being an important reason for failure in applied Natural Language Processing (NLP) projects. We argue that there is a gap between academic research in NLP and its application to problems outside academia, and that this gap is rooted in poor mutual understanding between academic researchers and their non-academic peers who seek to apply research results to their operations. To foster transfer of research results from academia to non-academic settings, and the corresponding influx of requirements back to academia, we propose a method for improving the communication between researchers and external stakeholders regarding the accessibility, validity, and utility of data based on Data Readiness Levels \cite{lawrence2017data}. While still in its infancy, the method has been iterated on and applied in multiple innovation and research projects carried out with stakeholders in both the private and public sectors. Finally, we invite researchers and practitioners to share their experiences, and thus contributing to a body of work aimed at raising awareness of the importance of data readiness for NLP.

data readiness, objective, stakeholder, (11 more...)

2110.05464

Country:

North America > United States (0.28)
Europe > Sweden (0.05)
Oceania > Australia > Victoria > Melbourne (0.04)
(2 more...)

Genre: Research Report (0.50)

Industry:

Information Technology > Security & Privacy (0.93)
Health & Medicine (0.69)

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

#artificialintelligenceApr-20-2021, 05:10:11 GMT

AWS Offers Course on Basics of Machine Learning - InformationWeek

On one hand, organizations recognize the potential value of machine learning to scale operations, gain faster and deeper insights, respond to quickly changing conditions, and more. On the other hand, it's hard to get started on something that is novel to your organization. You may not have the talent in-house, and you don't have any experience. What's more, even for those organizations that have run successful pilots, many have struggled to move those pilots into production for a variety of reasons. It feels like many organizations are stuck.

aw offer course, business problem, machine learning, (9 more...)

Genre: Instructional Material > Course Syllabus & Notes (0.52)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.89)

#artificialintelligenceAug-1-2020, 10:30:38 GMT

Taking Matters into Your Own Hands

See also the article by Pan et al in this issue. Safwan S. Halabi, MD, is a clinical associate professor of radiology at the Stanford University School of Medicine and serves as the medical director for radiology informatics at Stanford Children's Health. Dr Halabi's clinical and administrative leadership roles are directed at improving quality of care, efficiency, and patient safety. His current academic and research interests include imaging informatics, deep/machine learning in imaging, artificial intelligence in medicine, clinical decision support, and patient-centric health care delivery. Bone age assessment became an early AI "poster child" that demonstrated the power of applying regression and machine learning techniques to a mundane and monotonous radiologic diagnostic task.

artificial intelligence, deep learning, machine learning, (16 more...)

Country: North America > United States > Ohio (0.05)

Industry:

Health & Medicine > Nuclear Medicine (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.52)

#artificialintelligenceJul-16-2019, 02:16:10 GMT

Top firms to double number of AI projects by 2020: Gartner

Organisations that are working with artificial intelligence (AI) or machine learning (ML) have, on average, four projects utilising these technologies in place, according to a recent survey by Gartner. The survey finds 59 per cent of respondents have deployed AI. These respondents expect to add six more projects in the next 12 months, and another 15 within the next three years. This means that in 2022, those organisations expect to have an average of 35 AI or ML projects in place, says Gartner, in its AI and ML Development Strategies study . The analyst firm says the study is based on the results of a survey it conducted in December 2018 with 106 Gartner Research Circle members. The latter is a Gartner-managed panel composed of IT and IT/business professionals.

artificial intelligence, machine learning, social media, (11 more...)

Genre: Questionnaire & Opinion Survey (0.58)

Industry: Information Technology (0.37)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.57)
Information Technology > Communications > Social Media (0.52)